Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling
نویسندگان
چکیده
Goodness-of-fit (GoF) testing is ubiquitous in statistics, with direct ties to model selection, confidence interval construction, conditional independence testing, and multiple just name a few applications. While the GoF of simple (point) null hypothesis provides an analyst great flexibility choice test statistic while still ensuring validity, most tests for composite hypotheses are far more constrained, as must have tractable distribution over entire space. A notable exception co-sufficient sampling (CSS): resampling data on sufficient guarantees valid using any chooses. But CSS requires compact (in information-theoretic sense) statistic, which only holds very limited class models; even logistic regression, powerless. In this paper, we leverage concept approximate sufficiency generalize essentially parametric asymptotically efficient estimator; call our extension “approximate CSS” (aCSS) testing. We quantify finite-sample Type I error inflation aCSS show that it vanishing under standard maximum likelihood asymptotics, statistic. apply proposed procedure both theoretically simulation number models interest demonstrate its power.
منابع مشابه
On Testing Independence and Goodness-of-fit in Linear Models
We consider a linear regression model and propose an omnibus test to simultaneously check the assumption of independence between the error and the predictor variables, and the goodness-of-fit of the parametric model. Our approach is based on testing for independence between the residual and the predictor using the recently developed Hilbert-Schmidt independence criterion, see [GFT+08]. The prop...
متن کاملDifferentially Private Chi-Squared Hypothesis Testing: Goodness of Fit and Independence Testing
Hypothesis testing is a useful statistical tool in determining whether a given model should be rejected based on a sample from the population. Sample data may contain sensitive information about individuals, such as medical information. Thus it is important to design statistical tests that guarantee the privacy of subjects in the data. In this work, we study hypothesis testing subject to differ...
متن کاملGoodness-of-fit testing under long memory
In this talk we shall discuss the problem of fitting a distribution function to the marginal distribution of a long memory process. It is observed that unlike in the i.i.d. set up, classical tests based on empirical process are relatively easy to implement. More importantly, we discuss fitting the marginal distribution of the error process in location, scale and linear regression models. An int...
متن کاملBayes Factors for Goodness of Fit Testing
We propose the use of the generalized fractional Bayes factor for testing fit in multinomial models. This is a non-asymptotic method that can be used to quantify the evidence for or against a sub-model. We give expressions for the generalized fractional Bayes factor and we study its properties. In particular, we show that the generalized fractional Bayes factor has better properties than the fr...
متن کاملGoodness-of-fit and confidence intervals of approximate models
To test whether the model fits the data well, a goodness-of-fit (GOF) test can be used. The chi-square GOF test is often used to test the null hypothesis that a function describes the mean of the data well. The null hypothesis with this test is rejected too often, however, because the nominal significance level (usually 0.05) is exceeded. Alternatively, the level of Hotelling’s test is accurate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Annals of Statistics
سال: 2022
ISSN: ['0090-5364', '2168-8966']
DOI: https://doi.org/10.1214/22-aos2187